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Abstract 



C^ ' We introduce the concept of boundaries of a complex network as the set of nodes at distance larger 

than the mean distance from a given node in the network. We study the statistical properties of 
the boundaries nodes of complex networks. We find that for both Erdos-Renyi and scale-free model 
networks, as well as for several real networks, the boundaries have fractal properties. In particular, 
the number of boundaries nodes B follows a power-law probability density function which scales as 



^^ \ B ^. The clusters formed by the boundary nodes are fractals with a fractal dimension df ~ 2. We 

OO ' 

^^ ' present analytical and numerical evidence supporting these results for a broad class of networks. 

Our findings imply potential applications for epidemic spreading. 



Many complex networks are "small world" due to the very small average distance d 
between two randomly chosen nodes. Usually rf ~ In A^, where A^ is the number of nodes 
l|, y, y, y, [sl, l6| . Thus, starting from a randomly chosen node following the shortest path, 
one can reach any other node in a very small number of steps. This phenomenon is called 
"six degrees of separation" in social networks [4']. That is, for most pairs of randomly 
chosen people, the shortest "distance" between them is not more than six. Many random 
network models, such as Erdos-Renyi^etwork (ER) |l|, Watts-Strogatz network (WS) [5|] 



as well as many real networks, have been 



and random scale-free network (SF) |3|, Ig, [Zl, 
shown to possess this small-world property. 

Much attention has been devoted to the structural properties of networks within the 
average distance d from a given node. However, almost no attention has been given to 
nodes which are at distances greater than d from a given node. We define these nodes 
as the boundaries of the network and study the ensemble of boundaries. An interesting 
question is how many "friends of friends of friends etc..." one has at distance greater than 
the average distance d7 What is their probability distribution and what is the structure of 
the boundaries? The boundaries have an important role in several scenarios, such as in the 
spread of viruses or information in a human social network. If the virus (information) spreads 
from one node to all its nearest neighbors, and from them to all next nearest neighbors 
and further on until d, how many nodes do not get the virus (information), and what is 
their distribution with respect to the origin of the infection. Our results may explain why 
epidemics such as "black death" in medieval Europe stopped before reaching the entire 
population. 

In this Letter, we find theoretically and numerically that the nodes at the boundaries, 
which are of order A^, exhibit similar fractal features for many types of networks, including 
ER and SF models as well as several real networks. Song et al. [Qi] found that some networks 
have fractal properties while others do not. Here we show that almost all model and real 
networks including non-fractal networks have fractal features at their boundaries which are 
different from Song et al. 

Fig. [1] demonstrates our approach and analysis. For each node, we identify the nodes 
at distance (, from it as nodes in shell C.. We chose a random origin node and count the 
number of nodes Bi at shell L We see that i?i=10, B2=ll, i?3=13, etc... We estimate 
the average distance d ~ 2.9 by averaging the distances between all pairs of nodes. After 



removing nodes with i < d = 2.9, the network is fragmented into 12 clusters, with sizes 
S3={1, 1,2, 5, 1,3, 1,1, 8, 1,2, 3}. 




cluster of 8 nodes 



FIG. 1: (Color on line) Illustration of shells and clusters originating from a randomly chosen 
node, which is shown in the center (red). Its neighboring nodes are defined as shell 1, the nodes 
at distance i are defined as shell i. When removing all nodes with i < 3, the remaining network 
becomes fragmented into 12 clusters. 



We begin our study by simulating ER and SF networks, and later present analytical 
proofs. Fig. [2^ shows simulation results for the number of nodes Be reached from a randomly 
chosen origin node for an ER network. The results shown are for a single network realization 
of size A^ = 10^, with average degree (k) = 6 and d ~ 7.9 llj. For i < d, the cumulative 
distribution function, P{Bi), which is the probability that shell i has more than Bi nodes, 
decays exponentially for B^ > B^, where B^ is the maximum typical size of shell i 12|. 
However, for i > d, we observe a clear transition to a power law decay behavior, where 
P{Be) ~ 57^, with /3 ^ 1 and the pdf of Be is P{Bi) = dP{Be)/dBe ~ B^^. Thus, 
our results suggest a broad "scale-free" distribution for the number of nodes at distances 
larger than d. This power law behavior demonstrates the fractal nature of the boundaries 
of network, suggesting that there is no characteristic size and a broad range of sizes can 
appear in a shell at the boundaries. Further fractal features of the boundaries structure will 
be shown below. 

In SF networks, the degrees of the nodes, k, follow a power law distribution function 
q{k) ~ k~^, where the minimum degree of the network is chosen to be 2. Fig. |2]d shows, for 
SF networks with A = 2.5, similar power law results, P{Be) ~ B^ for £ > d as for ER, with 



a similar power /3 ~ 1. We find similar results also for A > 3 (not shown). 




FIG. 2: The cumulative distribution function, P{B()^ for two random network models: (a) ER 
network with A^ = 10^ nodes and {k) = 6, and (b) SF network with A^ = 10^ nodes and A = 2.5, 
and two real networks: (c) the High Energy Particle (HEP) physics citations network and (d) the 
Autonomous System (AS) Internet network. The shells with i > d are marked with their shell 
number. The thin lines from left to right represent shells i =1, 2... respectively, with i < d. 
For £ > d, P{Bi) follows a power-law distribution P{Bi) ~ B^, with /? ~ 1 (corresponding to 
P{Bi) ~ BJ for the pdf). The appearance of a power law decay only happens for I larger than 
d ^ 7.9 for ER and d ^ 4.7 for the SF network. The straight lines represent a slope of —1. 



To test how general is our finding, we also study several real networks (Figs. ^, 2d), in- 
cluding the High Energy Particle (HEP) physics citations network [ij] and the Autonomous 
System (AS) Internet network [lO|, ll5|. Our results suggest that the fractal properties of the 
boundaries appear also in both networks, with similar values of /3 ~ 1 for £ > rf |16|] . 

Next we ask how many nodes are on average at the boundaries? Are they a finite fraction 
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FIG. 3: (Color on line) (a) Normalized average number of nodes at shell i, {Bi)/N, as a function of 
£ — In A^/ln(fc) for ER network with < k >=6. For different N, the curves collapse, (b) k^ + l, which 
is {k'j)/{k£), as function of £ shown for both ER and SF network, (c) The probability distribution 
function P{Bf) in shells I < d for ER network. For small values of Bi, P{B() ~ S^, where ^ 
depends on the {k) of the network (Eq. (4)). (d) The fraction of nodes outside shell f + m, r^+m, as 
a function of r^ for ER network, where r^ is calculated for any possible L The (red) lines represent 
the theoretical iteration function (Eq. (6)). 

of A^, or less? In Fig. [Sti, we study the mean number {Bi) in shell £, and plot {Bi)/N as 
function of £ — lnA^/ln(fc) for different values of N for ER network. The term lnA^/ln(A;) 
represents the average distance d of the network [2]. We find that, for different values of N, 
the curves collapse, supporting a relation independent of network size A^. Since [Bi) /N is 
apparently constant and independent of N , it follows that (-B^) ~ A^, i.e., a finite fraction 
of A^ nodes appear at each shell including shells with i > d. We find similar behavior 
for SF network with A = 3.5 (not shown). The branching factor [13] of the network is 



k = {k'^)/{k) — 1, where the averages are calculated for the entire network. Similarly, we 
define ki = {kj)/{ke) — 1, where the averages are calculated only for nodes in shell i. Above 
the average distance, ki + 1 decreases with £ for both ER and SF networks (Fig. [Sb). Thus, 
at the shells where power law behavior of P{Bg) appears (Fig. [2]), the nodes have much lower 
ki + 1 compared with the entire network. The approach oi ki + 1 to 1 (ER network) and 2 
(SF network) is consistent with a critical behavior at the boundaries of the network |l3l |. 

Fig. [3b shows that P{Bi) ioi £ < d and small values of Bi increase as a power law, 
P{Bi) ~ B^, for ER network, where /i depends on k (supporting the theory developed 
below). We define the fraction of nodes outside shell m as r^ = 1 — (S^i Bi)/N. There 
exists a functional relation Eq. (6), which is independent of £, between any two r^ and r^+m 
{m = 1, 2, 3...), for ER network in Fig. [3li. Figs. [3b, 3d provide empirical evidences for the 
theory developed below. 

Next, we study the structural properties of the boundaries. Removing all nodes that 
are within a distance £ > d (not including shell £), the network will become fragmented 
into several clusters (see Fig. [1]). We denote the size of those clusters as g/, the number 
of clusters of size Si as n{si), and the average distance in the clusters as di [l7|. We find 
n{s) ~ s~^, with 6 ~ 3.0 (Figs, [lb and 4b). Similar relations are also found for ER and 
other real networks. The relation between the size of the clusters Si and their mean distance 
di is shown in Figs, [lb and 4d, for SF (A = 2.5) and HEP citations networks respectively. 
These plots suggest a power law relation, Si ~ d'^, with ip ^ 2. It indicates that the 
clusters at the boundaries are fractals with fractal dimension df = 2 a.s percolation clusters 
at criticality 18|. Note that, for very large clusters their average distances di decrease with 
size, suggesting that the largest clusters are not fractals. We find that the fractal dimension 
is df = (f ^ 2 also for ER, SF with A = 3.5 and several other real networks. 

Next we present analytical derivations supporting the above numerical results. We denote 

the degree distribution of a network as q{k). In infinitely large network we can neglect 

oops for £ < d and approximate the behavior of forming a network as a branching process 
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2l| . The probability of reaching a node with k outgoing links through a link is 
q{k) = (fc + l)q{k + l)/{k). The probability of number of branches equals to q{k). We define 
the generating function of q{k) as ^0(3;) = J2T=oli^)^^y ^^e generating function of q{k) as 
Gi{x) = Y.T=Qq{k)x^ = G'^{x)/{k). For ER networks we have Gq{x) = G'i(x) = e<^'>(^-i). 




FIG. 4: The number of clusters of sizes S£, n{si), as function of se after removing nodes within sheh 
i for: (a) SF network with N = 10^ and A = 2.5, (b) HEP citations network, and si as function of 
average distance di of the clusters for (c) SF network with N=10^ and A = 2.5, (d) HEP citations 



network. The relation between n{se) and sg is characterized by a power law, n{si) 
9 Ki 3. Also, S£ scales with d^ as si ~ df, with cp ^ 2. 



H > 



with 



The generating function for the number of nodes, Bm-, at the shell m is 22l |: 



G^{x) = Go(G'i(...(Gi(x)))) = G,{G^-\x)) 



(1) 



where Gi{Gi{...)) = G™ ^(x) is the result of applying Gi{x), m — 1 times. P{Bm), which 
is the probability distribution of B^, is the coefficient of x^" in the Taylor expansion of 

Gmix). 



For shells with large m but still much smaller than d, we exp ect 22l | that the number of 
nodes will increase by a factor of k. It is possible to show 20| that G^~^{x) converges to 
a function of the form /((I — x)k"^) for large m {m << d), and f{x) satisfies the Poincare 



functional relation: 

G^{f{y)) = f{y~k), (2) 

where y = 1 — x. The function form of f{y) can be uniquely determined from Eq. (2). 

The solution of G'i(/oo) = /oo gives the probability that a link is not connected to the 
giant component of the network by one of its ends 2l|]. It is known 201] that f{x) has an 
asymptotic functional form, f{y) = foo + ciy~^ + Q{y^). Expanding both sides of Eq. (2) we 
obtain: 

Gi(/oo) + G[{f^)ay-' = f^ + ak-'y-' + 0(/). (3) 

Since G'i(/oo) = /oo, we have 5 = -lnG[{foo)/ Ink. 

If g(l) = and g(2) ^ 0, from G,{f^) = f^^e have U = and G[iU) = Gl(Q)/{k) = 
2q{2)/{k). If g(2) = g(l) = (Bottcher case [2(J]), then 5 = oo, which indicates that f{y) 
has an exponential singularity. Therefore, networks with minimum degree fcm > 3 do not 
exhibit the following properties for ?7i << d, and therefore have no fractal boundaries. 

Applying Tauberian like theorems 20|, l23|] to f{y), which has a power-law behavior for 



y 



oo, Dubuc 



24l | concluded that the Taylor expansion coefficient of G^ 



[X 



P{B„ 



behaves as Bl^ with an exponential cutoff at B^ ~ k"^. When g(l) 7^ and g(2) 7^ 0, we 
have fi = 6 — 1 and when g(l) = and g(2) 7^ 0, we have fi = 26 — 1. Thus the distribution 
of the number of nodes in the shell m with m « d has a power law tail for small values of 

Bm'- 

P{B^) ~ 5^. (4) 

For ER network, Eq. (4) is supported by simulations for m < d in Fig. [3b. 

The above considerations are correct only for m < d, for which the depletion of nodes 
with large degree in the network is insignificant. 

In a large network, the shells with m » 1 behave almost deterministically and there 
exists a functional relation between any two shell m and shell n with n > m {a detailed 
proof will be given elsewhere): 



r„ = Go(Gr'"(Go-^(r„0)), 



(5) 



where r„ is the fraction of nodes outside shell n. It can also be shown that the branching 
factor in the r„ fraction of nodes is A;(r„) = uGq{u)/Gq{u), where u = G(7^(r„). 



For ER networks, Eq. (5) yields: 

r.+i = e^'^^^^-i) = S,-og(A;)r,^ (6) 

which is vahd for all possible L We test it in Fig. [311. 

When m « d and n » d, using the same considerations as before it can be shown 
that: 

Tn = [a~k{l - r„,)]-^'-^ + r^, (7) 

where Too = Go(/oo) is the fraction of nodes not belonging to the giant component of the 
network, a is a constant. 

Based on Eqs. (4) and (7), expressing r^ and r„ in terms of Bm and Bn, we find that for 
m « d and n » d, Bn ~ -B„^'^^. Using P{Bn)dBn = P{Bm)dBm, we obtain 

P{Bn) ~ 5-i-^/(^+^)-i/(^+i) = B~^, (8) 

supporting the numerical findings in Fig. [2J 

These results are rigorous when k exists and when the minimum degree km "^ 2. For 
SF networks with A < 3, A; diverges for N ^ oo. But for finite A^, k still exists. Thus the 
above results can also be applied to the case of A < 3. For both ER and SF networks with 
km > 3, the power law of P{Bn) with n » d cannot be observed, as we indeed confirm by 
simulations. 

Relating our problem with percolation theory, we can explain the simulation results of 
probability distribution of cluster size se. The cluster size distribution in percolation at some 
concentration p close to pc is determined by the formula [l3| : 

Pp{s > 5) ~ S-^+'exp{-S\p-p,\'/^) . (9) 

In the case of random networks the percolation threshold is given by pc = 1/k. In the 
exterior of the shell n {n » d), we can estimate \p — pd ~ {k{rn) — l)/k, where fc(r„) 
decreases and reaches the critical percolation value of 1. 

The cluster size distribution can be estimated by introducing a sharp exponential cutoff 
at s = ^; ~ |fc(r„) - 1|-^, so that P„(s > ^) ~ S-^+^P{S;^ > S), where P(^; > S) is the 
probability for a given shell to have S* > S. 

Since r„ — Too has a smooth power law distribution and k{r oo) < 1, the probability that 
k(rn)-l\ kS-"" = e is proportional to e. Thus P{S;^ > 5) ~ 5"'^ and P„(s > S) = S-^^^-" 



25|. Therefore the cluster size distribution follows n(s) ~ s 



<r+a) 



For ER networks and SF networks with A > 4, r = 2.5 and a = 0.5, the above derivations 
lead to n{s) ~ s~^. For SF networks with 2 < A < 4, r = (2A - 3)/(A - 2) and a = 



I A — 3|/(A — 2) 181]. Thus, for SF network with A > 3, there will be n^ ~ s^^. We conjecture 
Ug ~ s~^ even for 2 < A < 3, although in this case fc(r„) does not exist and the above 
derivations are not valid. Our numerical simulations support these results in Fig. H^, b. 

In summary, we find empirically and analytically that the boundaries of a broad class 
of complex networks including non-fractal networks |9(] have fractal features. Our findings 
can be applied to the study of epidemics. It imphes that a strong decay of the epidemic 
will happen in the boundaries of human network, due to the low degree of nodes. The 
fractal clusters at the boundaries, which are connected sparsely to the bulk, may explain 
why big breakout of epidemical disease (such as the "black death" in medieval Europe) 
would suddenly stop after affecting a large percentage of population. 

We thank ONR and Israel Science Foundation for financial support. 



[1] P. Erdos and A. Renyi, Publ. Math. 6, 290 (1959); Publ. Math. Inst. Hung. Acad. Sci. 5, 17 

(1960). 
[2] B. Bollobas, Random Graphs (Academic, London, 1985). 
[3] R. Albert and A.-L.Barabasi, Rev. Mod. Phys. 74, 47(2002). 
[4] S. Milgram, Psychol. Today 2, 60-67 (1967). 

[5] D. J. Watts and S. H. Strogatz, Nature (London) 393, 440 (1998). 
[6] R. Cohen and S. Havlin, Phys. Rev. Lett. 90, 058701 (2003). 
[7] S. N. Dorogovtsev and J. F. F. Mendes, Evolution of Networks: from Biological nets to the 

Internet and WWW (Oxford University Press, New York, 2003). 
[8] R. Pastor-Satorras and A. Vespignani, Evolution and Structure of the Internet: a statistical 

physics approach (Cambridge University Press, 2004). 
[9] C. Song et al, Nature (London) 433, 392 (2005); Nature Physics 2, 275 (2006). 
[10] S. Carmi et al, PNAS 104, 11150 (2007). 
[11] Different realizations yield similar results. In one realization, a certain fraction of nodes are 

randomly taken to be origin. The histogram is obtained from Bi belonging to different origin 

nodes. 



10 



[12] The behavior of the pdf of Bi for £ < d will be discussed later and is shown in Fig. [3]:. 
[13] R. Cohen et al, Phys. Rev. Lett. 85, 4626 (2000). 



[14] Derived from the HEP section of arxiv.org; http://vlado.fnif.uni-lj.si/pub/networks/data/hep-th/hep-th.htm 

(website of Pajek). 
[15] Y. Shavitt and E. Shir, DIMES - Letting the Internet Measure Itself, 



http://www.arxiv.org/abs/cs.NI/0506099 



[16] We also find similar results (not shown here) for other real networks. 

[17] Fractional shells with i + a {0 < a < 1) are used here by removing nodes within shell i and a 

fraction a of nodes at shell i. 
[18] R. Cohen et al., Phys. Rev. E 66, 036113 (2002); Chap. 4 in Handbook of graphs and networks, 

Eds. S. Bornholdt and H. G. Schuster (Wiley- VCH, 2002) 
[19] T. E. Harris, Ann. Math. Statist. 41, 474 (1948); The Theory of Branching Processes 

(Springer- Verlag, Berlin, 1963). 
[20] N. H. Bingham, J. Appl. Probab. 25A, 215 (1988). 
[21] L. A. Braunstein et al. Int. J. Bifurcation and Chaos 17, 2215 (2007). 
[22] M. E. J. Newman et al, Phys. Rev. E 64, 026118 (2001). 
[23] G. H. Weiss, Aspects and Applications of the Random Walk (North Holland Press, Amsterdam, 

1994). 
[24] S. Dubuc, Ann. Inst. Fourier 21, 171 (1971). 
[25] A.L.Barabasi et al., Phys. Rev. Lett. 76, 2192 (1996). 



11 



